Progressive Data Integration and Semantic Enrichment Based on LinkedScales and Trails

نویسندگان

  • Matheus Silva Mota
  • Fagner Leal Pantoja
  • Júlio Cesar dos Reis
  • André Santanchè
چکیده

The integration of data elements scattered along different resources, with heterogeneous formats, can take advantage of an approach with progressive and lightweight steps, instead of pursuing costly upfront mappings. To support such approach, we defined a multiscale-based dataspace architecture, called LinkedScales, which carries an integration process via graph-based transformations over a graph database. A series of scales in the dataspace systematizes an integration and enrichment chain of steps to leverage transformation processes, which incrementally go from raw representations towards ontology-like structures. However, how to record and keep track of the intermediary outcomes in the integration chain remains an open research challenge. This article proposes combining the concept of scales with trails – lightweight, scale-specialized semantic annotations to enable progressive integration towards a semantic representation. We conduct experiments involving organism-centric analysis in life science to show the benefits of trails for transformation between scales.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Conceiving a Multiscale Dataspace for Data Analysis

A consequence of the intensive growth of information shared online is the increase of opportunities to link and integrate distinct sources of knowledge. This linking and integration can be hampered by different levels of heterogeneity in the available sources. Existing approaches focusing on heavyweight integration – e.g., schema mapping or ontology alignment – require costly upfront efforts to...

متن کامل

Multiscaling a Graph-based Dataspace

Biologists increasingly need a unified view to understand and discover relationships among data elements scattered along data sources with different levels of heterogeneity. Existing approaches usually adopt ad-hoc heavyweight integration strategies, requiring a costly upfront effort involving a monolithic chain of steps to handle specific formats/schemas, with low or no reuse. This article pro...

متن کامل

Adaptive Information Analysis in Higher Education Institutes

Information integration plays an important role in academic environments since it provides a comprehensive view of education data and enables mangers to analyze and evaluate the effectiveness of education processes. However, the problem in the traditional information integration is the lack of personalization due to weak information resource or unavailability of analysis functionality. In this ...

متن کامل

Adaptive Information Analysis in Higher Education Institutes

Information integration plays an important role in academic environments since it provides a comprehensive view of education data and enables mangers to analyze and evaluate the effectiveness of education processes. However, the problem in the traditional information integration is the lack of personalization due to weak information resource or unavailability of analysis functionality. In this ...

متن کامل

An Improved Semantic Schema Matching Approach

Schema matching is a critical step in many applications, such as data warehouse loading, Online Analytical Process (OLAP), Data mining, semantic web [2] and schema integration. This task is defined for finding the semantic correspondences between elements of two schemas. Recently, schema matching has found considerable interest in both research and practice. In this paper, we present a new impr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016